Historical Text Processing, Manuscript Grammars, Period Document Analysis, Ancient Formats

From Documents to Dialogue: A step-by-step RAG Journey
dev.to·5h·
Discuss: DEV
📊Multi-vector RAG
Offensive OSINT s05e10 - Interactive investigative stories part 1
offensiveosint.io·2d
🌐WARC Forensics
IASC: Interactive Agentic System for ConLangs
arxiv.org·15h
🌳Context free grammars
To MD - Convert PDFs, Word, HTML and more to Markdown
tomd.io·11h·
Discuss: Hacker News
🔄Migration Tools
Show HN: Lore Engine – Turn 10-hour lectures into 2 hours of comprehensive notes
github.com·22h·
Discuss: Hacker News
📄Document Streaming
Take Note: Cyber-Risks With AI Notetakers
darkreading.com·1d
🎫Kerberos Attacks
Extract speaker notes from PowerPoint to text
dri.es·1d
📜Palimpsest Analysis
Cactus Language • Semantics 3
inquiryintoinquiry.com·3h
🔢Denotational Semantics
Algorithmic Archive Project: Use Cases (1/3)
blogs.bodleian.ox.ac.uk·3d
📊Citation Graphs
Show HN: Using an LLM to sensibly sort a shopping receipt
treblig.org·1d·
Discuss: Hacker News
🔗Constraint Handling
Writing regex is pure joy. You can't convince me otherwise.
triangulatedexistence.mataroa.blog·17h·
Format Verification
Beyond Vector Search: Building a RAG That *Actually* Understands Your Data
dev.to·1d·
Discuss: DEV
🗂️Vector Databases
Computable Babylonian Diaries Project
christopherwolfram.com·4h·
Discuss: Hacker News
📜Digital Philology
OCR vs ADE: Mechanisms Behind the Methods
dev.to·1d·
Discuss: DEV
📄OCR
Work in content? You should be using AI for alt text
tk.gg·20h·
Discuss: Hacker News
📄PostScript
Efficient and accurate search in petabase-scale sequence repositories
nature.com·2d·
Discuss: Hacker News
🔄Burrows-Wheeler
Mind the Gap: Quantifying Vocabulary Mismatch in E-Commerce Site Search
searchhub.io·1d·
Discuss: Hacker News
📈Search Quality
[R] A Unified Framework for Continual Semantic Segmentation in 2D and 3D Domains
reddit.com·14h·
📝Document Chunking
AI as both authors and reviewers of research papers
openreview.net·23h·
Discuss: Hacker News
🔲Cellular Automata